A Note to the Reader

نویسنده

  • Peter Andreasen
چکیده

This paper is organized in three almost self contained sections. The first section is a popular discussion on the number of crosswords based on a remark of Shannon made in [Shannon, 1948]. Although never published, the ’back-of-the-envelope’ calculation we present is believed to be the same argument Shannon had in mind when he wrote the passage on crosswords. The second section presents the Hausdorff dimension and the box-counting dimension and gives an introduction to the theory of self similar sets. Methods for calculating the Hausdorff dimension of self similar sets are discussed; in particular we prove Hutchinson’s formula for the dimension of sets represented by iterated function systems (IFS). The third and last section describes how the topics of the two previous sections are in fact connected. Indeed, we argue that the problem of calculating the number of crosswords is related to finding the Hausdorff dimension of certain self similar sets. In addition we identify a connection between box counting dimension and Hartley entropy. I ON THE NUMBER OF CROSSWORDS The American mathematician Claude E. Shannon is widely recognized as the father of information theory. His most famous paper is “A Mathematical Theory of Communication” from 1948 in which he lays the foundation for the modern information theory, and even today the paper shows an impressive combination of clarity and vision. We quote a few passages from the end of section 7, wherein we find a curious reference to suchmundanematters as crossword puzzles. . . The ratio of the entropy of a source to the maximum value it could have while still restricted to the same symbols will be called its relative entropy. This, as will appear later, is the maximum compression possible whenwe encode into the same alphabet. Oneminus the relative entropy is the redundancy. The redundancy of ordinary English, not considering statistical structure over greater distances than about eight letters, is roughly 50%. This means that when we write English half of what we write is determined by the structure of the language and half is chosen freely. The figure 50%was found by several independent methods which all gave results in this neighborhood. One is by calculation of the entropy of the approximations to English. A second method is to delete a certain fraction of the letters and then let someone attempt to restore them. If they can be restored when 50% are deleted the redundancy must be greater than 50%. A third method depends on certain known results in cryptography. Two extremes of redundancy in English prose are represented by Basic English and by James Joyce’s book Finnegans Wake. The Basic English vocabulary is limited to 850 words and the redundancy is very high. This is reflected in the expansion that occurs when a passage is translated into Basic English. Joyce on the other hand enlarges the vocabulary and is alleged to achieve a compression of semantic content.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Divining Reader: A Construct Based on the Bibliomantic Approach to Hafez’s Divan

Hafez Shirazi was a distinguished Persian poet. His poetry collection, Divan, is regarded as a literary work of profound significance. Iranians view this collection as something much more than poetry because it is also used for bibliomantic purposes. After studying Hafez in his social context and exploring distinctive qualities of his Divan, particularly its application as a divination tool, th...

متن کامل

Methods of Persuading the Reader in Golestan by Saadi

Moralistic and didactic texts make up a large part of Persian literature. Undoubtedly, if "the inculcation of a particular concept into the mind of the reader and the attempt to persuade and conquer his mind" is not the main purpose of these texts, it is definitely one of their most important goals. In this sense, the poet or writer of such texts tries to persuade the reader and sway his mind t...

متن کامل

Identification of the Features of E-reader Applications and Evaluation of Widely Used Iranian Applications

Purpose: The purpose of this study is to identify the features of e-reader applications through a systematic review of texts, and also to evaluate four widely used Iranian e-reader applications (Fidibo, Taghche, Ketabrah, ketab sabz) in terms of identified features. Method: The present redearch is an applied study in terms of purpose that was conducted on the basis of a systematic review frame...

متن کامل

Technical Note: An opportunity cost maintenance scheduling framework for a fleet of ships: A case study

The conventional method towards deriving schedule for a fleet of ships to minimize cost alone has the short-coming of not addressing the problem of operation revenue losses associated with delays during maintenance at ships dockyards. In this paper, a preventive maintenance schedule for a fleet of ships that incorporates op-portunity cost is presented. The idea is to assign a penalty cost to al...

متن کامل

Methods of Representing Implied Author and Implied Reader in GavKhuni and its Film Adaptation

Implied author and implied reader are components of a narrative structure, playing a crucial role in illuminating hidden elements within a text. Therefore, studying these elements in literary works can expose the underlying layers of narration and open up new critical outlooks to the readers. Furthermore and in cases of cinematic adaptations, it explains whether this was the novel or the film t...

متن کامل

Editorial Volume 5, Issue1

Applied Literature, however, does not have literature at its centre. Literature in this domain is a tool to solve problems and achieve goals. Using literature to teach and learn languages, the application of literature to language education, is a very handy example. Health Humanities (by Crawford, et al. and reviewed by A. Ramazani in our Journal's previous issue) comprises chapters on how lite...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001